<HTML>

<HEAD>
<TITLE>Adaptation of a Keyphrase Extractor for Japanese Text</TITLE>
</HEAD>

<BODY BGCOLOR="#ffffff">

<MAP NAME="banner_top">
<AREA SHAPE="rect" COORDS="588,14,620,40" HREF="http://www.iit.nrc.ca/english.html">
<AREA SHAPE="rect" COORDS="538,14,583,37" HREF="http://www.corpserv.nrc.ca/corpserv/nrc.html">
<AREA SHAPE="rect" COORDS="86,4,421,37" HREF="http://www.iit.nrc.ca/II_public/index.html">
</MAP>

<table cellpadding="5" cellspacing="0" border="0" width="100%">
<tr><td valign="bottom" align="left">
<IMG SRC="../banner_top.jpg" width="620" height="37" alt="II Group Banner" 
USEMAP="#banner_top"ISMAP border="0"><BR><IMG SRC="../banner_extractor.jpg" 
width="217" heigth="49" alt="Extractor">
</td></tr>
</table>

<FONT COLOR="#400080">
<H2>Adaptation of a Keyphrase Extractor for Japanese Text</H2>
</FONT>

<HR>

<TABLE>
<TR>
<TD>
<B>Format</B>
</TD>
<TD>
<B>File</B>
</TD>
<TD ALIGN=RIGHT>
<B>Bytes</B>
</TD>
<TD>
<B>Viewer</B>
</TD>
</TR>

<TR>
<TD>
PDF
</TD>
<TD>
<A HREF=CAIS99.pdf><CODE>CAIS99.pdf</CODE></A>
</TD>
<TD ALIGN=RIGHT>
96,117
</TD>
<TD>
<A HREF=http://www.adobe.com/prodindex/acrobat/readstep.html>free PDF viewer</A>
</TD>
</TR>

<TR>
<TD>
PostScript
</TD>
<TD>
<A HREF=CAIS99.ps><CODE>CAIS99.ps</CODE></A>
</TD>
<TD ALIGN=RIGHT>
728,533
</TD>
<TD>
<A HREF=http://www.cs.wisc.edu/~ghost/>free PostScript viewer</A>
</TD>
</TR>

<TR>
<TD>
Compressed PostScript
</TD>
<TD>
<A HREF=CAIS99.ps.Z><CODE>CAIS99.ps.Z</CODE></A>
</TD>
<TD ALIGN=RIGHT>
293,001
</TD>
<TD>
<A HREF=http://www.cs.wisc.edu/~ghost/>free PostScript viewer</A>
</TD>
</TR>
</TABLE>

<P>

<HR>

<DL>
<DT> Mathieu, J. (1999).
<DD> Adaptation of a keyphrase extractor for Japanese text.
<DD> <I>Proceedings of the 27th Annual Conference of the Canadian Association for 
Information Science (CAIS-99)</I>, 
<DD> Sherbrooke, Quebec, pp. 182-189.
</DL>

<HR>

<H4>Abstract</H4>

This paper presents some statistical observations relevant to Japanese 
keyphrase extraction, as well as the details of the implementation of 
a keyphrase extraction algorithm (called Extractor) for Japanese documents.  
Parts of the algorithm include an efficient method of extracting the 
keyphrase candidates, a way to pinpoint the most probable keyphrases 
using contextual information, a technique to find the main ideas conveyed 
in the text, and a way to express those ideas using extracted phrases.  
Finally, a comparison with the English and French versions of Extractor 
will be presented.

<HR>

<CENTER>
<table border="1" bgcolor="#ccccff">
<tr><td><font size=2>
[ <a href="http://extractor.iit.nrc.ca/">Extractor Home</a> |
<A HREF="http://www.iit.nrc.ca/II_public/french.html">Fran&ccedil;ais</a> |
<A HREF="http://www.iit.nrc.ca/english.html">IIT</A> |
<A HREF="http://www.iit.nrc.ca/II_public/index.html">II Group</A> |
<A HREF="http://www.corpserv.nrc.ca/corpserv/nrc.html">NRC</A> |
<A HREF="http://ai.iit.nrc.ca/search.html">Search</A> |
<A HREF="mailto:Peter.Turney@iit.nrc.ca">Feedback</A> ]
[ <I>Updated</I>: July 10, 2000 ]</font size=2></td></tr>
</table>
</CENTER>

</BODY>
</HTML>

